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Sunimary:  An  axiomatic  study  is  made  of 

various  criteria  for  playing  games  against 
Nature;  and  a  new  criterion  is  suggested. 
The  concept  of  equilibrium  point  is  de¬ 
fined  for  n-person  games  against  Nature. 


GAMES  AGAINST  NATURE 

John  Milnor 


Introduction. 

We  wish  to  study  games  of  the  following  type.  A  matrix  is 
given  in  which  a  player  must  choose  a  row.  A  column  will  be  chosen 
by  "Nature",  whose  payoff  is  unknown  or  identically  zero.  The  pay¬ 
off  to  the  player  is  given  by  the  entry  in  that  particular  row  and 
column.  This  entry  should  represent  a  von  Neumann-Morgenstern 
utility.  It  is  of  course  also  interesting  to  study  the  case  of  an 
infinite  game,  but  this  will  not  be  attempted  here. 

The  following  criteria  have  been  suggested  for  this  situation. 
According  to  Laplace,  if  the  relevant  probabilities  are  unknown, 
we  should  assume  that  they  are  equal.  In  other  words,  the  player 
should  choose  the  row  for  which  the  average  of  the  elements  is 
greatest . 

According  to  the  minimax  principle  which  is  associated  with 
Wald,  we  should  assume  that  the  payoff  to  Nature  is  the  negative 
of  the  payoff  to  the  player.  The  solution  is  then  just  the  solution 
of  the  matrix  considered  as  the  matrix  of  a  zero  sum  game. 

A  modification  of  this  criterion  was  suggested  by  Hurwicz. 
Select  a  constant  0  <  <  1  which  measures  the  player^s  optimism. 

For  each  row  or  probability  mixture  of  rows  let  m  denote  the 
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sraallest  component  and  M  the  largest..  According  to  Hurwicz  we 
should  select  one  of  those  rows,  or  mixtures  of  rows,  for  which 
c^M  +  (l-c<)m  attains  its  maximum.  For  o<=  0  this  reduces  to 
the  Wald  criterion. 

A  different  modification  of  the  minimax  principle  is  given  by 
Savage,  who  holds  that  the  payoff  to  Nature  should  be  the  "regret”, 
"loss",  or  "miss"  which  is  measured  by  the  maximum  of  the  elements 
of  a  column  minus  the  particular  entry.  In  other  words,  the  regret 
is  the  difference  between  what  could  have  been  gotten  if  the  state 
of  Nature  were  known  and  what  actually  was  gotten.  A  strategy  is 
to  be  chosen  as  if  the  regret  matrix  were  that  of  a  zero  sum  game. 

The  present  paper  will  attempt  to  study  these  criteria  and 
suggest  others  on  the  basis  of  an  axiomatic  approach. 


,$I.  Axioms  for  the  Laplace,  Wald,  and  Hurwicz  criteria. 

It  will  be  convenient  to  make  no  distinction  between  rows  and 
probability  combinations  of  rows.  We  will  first  consider  criteria 
which  assign  a  preference  relation  >  between  pairs  of  rows,  or 
between  pairs  of  probability  combinations  of  rows,  satisfying  the 
following  axioms. 

1.1  The  relation  >  is  a  complete  ordering.  In  other  words, 
the  following  two  laws  are  satisfied. 

1.1a  For  any  r  and  r*  either  r  >  rf  or  r'  >  r. 

1.1b  If  r  >  r'  >  r"  then  r  >  r". 

1.2  The  order  of  two  rows  is  not  changed  by  the  adjunction 
of  a  new  row. 
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1.3  It  is  not  changed  by  any  permutation  of  the  rows  or 
columns. 

1.4  If  each  element  of  r  is  greater  than  the  corresponding 
element  of  r'  then  r  >  rT. 

The  first  of  these  axioms  is  somewhat  stronger  than  is  neces¬ 
sary,  since  we  only  need  to  know  the  set  of  optimal  strategies  and 
not  the  complete  ordering  relation.  Axiom  1.2  seems  much  too 
strong  to  me.  Part  of  my  object  in  writing  section  I  is  to  show 
that  this  axiom  is  incompatible  with  others  which  seem  more  funda¬ 
mental  to  me.  The  criterion  of  Savage  violates  this  axiom  and 
therefore  is  not  considered  in  this  section.  Axiom  1.3  is  actually 
a  part  of  the  description  of  the  problem.  That  is,  we  are  not 
considering  games  in  which  there  is  any  reason  to  believe  that  one 
column  is  more  probable  than  another.  Axiom  1.4  has  been  put  in  a 
weaker  form  than  is  usual  since  the  Wald,  Hurwicz,  and  Savage  cri¬ 
teria  do  not  satisfy  the  stronger  condition  that  r  >  r'  whenever 
r  dominates  r'. 

Some  of  the  following  axioms  may  also  be  imposed. 

1.5  The  order  of  the  rows  is  not  changed  if  a  constant  is 
added  to  a  column. 

1.6  It  is  not  changed  if  a  new  column  c’  is  added,  if 

C  -  a,c,  +  •••  -  ancn  where  at  >  0,  a,  +  •••  +  an  -  1. 

1.7  It  is  not  changed  if  each  element  P.  .  of  the  matrix  is 

ij 

replaced  by  Xpij  +  A  >  0. 

1.3  The  ordering  relation  is  continuous  in  the  following  sense. 
If  a  sequence  of  matrices  P^  converges  to  P  and  if  for  all  i, 


RM— 679 


-4- 


r.  >  r! ,  then  the  limit  rows  r  and  r'  satisfy  r  >  r' . 

1.9  The  set  of  optimal  strategies  is  convex. 

Axiom  1.5  is  in  part  a  statement  that  there  is  no  reason  to 
believe  that  nature  is  against  us.  It  is  also  a  statement  that 
the  utility  is  linear,  not  only  with  respect  to  known  probabilities, 
but  also  with  respect  to  unknown  probabilities  of  the  type  under 
consideration.  It  is  satisfied  by  the  Laplace  and  Savage  criteria. 
Axiom  1.6  is  satisfied  by  all  but  the  Laplace  criterion,  while  1.7 
and  1.8  are  satisfied  by  all  four.  Axiom  1.9  states  that  it  is 
irrational  to  be  equally  willing  to  play  two  alternatives  and  yet 
not  be  willing  to  randomize  between  them.  It  is  violated  by  the 
Hurwicz  criterion  for  cK  >  0  (in  the  matrix  ^  q  for 

example) . 

The  following  assertions  will  be  proved  in  the  appendix. 

Axioms  1.1  through  1.4  are  implicitly  assumed. 

1.10  If  two  rows  differ  only  in  the  order  of  their  elements, 
then  they  are  equivalent. 

1.11  The  only  criterion  satisfying  1.5  is  that  of  Laplace. 

1.12  For  any  criterion  satisfying  1.6,  two  rows  having  the 
same  maximum  and  minimum  elements  are  equivalent. 

1.13  Axioms  1.6,  1.7,  and  1.8  imply  the  Hurwicz  criterion. 

1.14  Axioms  1.6,  1.8,  and  1.9  imply  the  Wald  criterion. 

£11.  Alternative  Axioms. 

Since  the  nine  axioms  considered  in  section  I  have  been  shown 
to  be  inconsistent,  it  is  natural  to  try  and  pare  them  down  to  a 
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set  of  more  fundamental  axioms.  The  following  is  a  list  of  those 
axioms  which  I  personally  feel  to  be  most  important.  Others  would 
doubtlessly  set  up  completely  different  lists.  Let  S  denote  the 
simplex  of  mixed  strategies  over  the  rows. 

2.1  There  is  a  choice  set  C(P^j)  which  is  contained  in  S. 

2.2  This  set  depends  continuously  on  P.  .  in  the  following 

sense.  If  approaches  P^,  xic€C^Pi<jk^  for  each  k»  and 

x,  approaches  x,  then  x<£C(P.  .).  In  particular  C  is  a  closed 
set . 

2.3  C  is  convex. 

2.L  The  choice  set  is  invariant  under  permutations  of  the 
rows  or  columns. 

2.5  Every  element  of  C  is  admissible  (undominated). 

2.6  The  choice  set  is  not  changed  if  a  new  column,  identical 
with  one  of  the  old,  is  added. 

These  axioms  are  not  compatible  with  any  of  the  criteria  under 
consideration.  That  of  Laplace  violates  2.6,  while  those  of  Wald, 
Hurwicz,  and  Savage  violate  2.5.  It  is  of  course  possible  to  inter¬ 
sect  the  choice  set  of  Wald,  Hurwicz,  or  Savage  with  the  set  of 
admissible  strategies,  and  thus  obtain  a  new  criterion.  The  result 
violates  2.2  however.  The  Hurwicz  criterion  also  violates  axiom 
2.3,  providing  that  °<  >  0 . 

As  an  example  of  the  discrepancy  between  these  axioms  and  the 
four  criteria  consider  the  following  family  of  games,  where 
0  <  k  <  1. 
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1 

0 

k 


0 

k 

0 


+*  Vi 

Let  denote  the  probability  of  playing  the  i  row  for 

i  =  1,  2,  3,  4.  For  k=l,  each  of  the  first  two  rows  is  dominated 
by  one  of  the  last  two.  For  any  criterion  satisfying  2.5,  there¬ 
fore,  each  point  of  the  choice  set  must  satisfy  a3  +  a4  =  1.  By 
continuity  a3  +  a4  must  be  close  to  one  for  k  close  to  one. 
Solutions  by  the  four  criteria,  and  graphs  of  these  solutions 
follow.  It  may  be  that  a  player  who  uses  the  Savage  criterion  will 
take  the  symmetries  of  the  matrix  into  account  and  consider  only 
symmetrical  strategies  (that  is  strategies  with  a,  =  aa,  a3  =  a4). 
The  solution  in  this  case  is  also  included  below,  since  it  illus^ 
trates  the  dependence  of  the  Savage  solution  on  the  set  of  strat¬ 
egies  considered. 
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Criterion 

Game 

a_2 

a4 

Admissibility 

k<  1 

.  .  anything  .  .  . 

k  =  l 

0 

0 

a  3  + 

a4  =  1 

Laplace 

k=0 

.  .  anything  .  .  . 

0<  k 

0 

0 

a3  + 

a4  =  1 

Wald 

and 

k<  1 

1 

2 

1 

2 

0 

0 

Savage 

k=l 

•  •  *  cl  ]  “ 

a  z  •  • 

• 

•  •  *  3.  ^  ~ 

3^  •  •  • 

Savage 

Symmetrical 

1-k 

1-k 

l 

l 

Solutions  Only 

2  ( 2-  k ) 

2  ( 2— k ) 

2 (2-k) 

2 (2-k) 

Hurwicz 

k<  1-2  ex' 

1 

1 

0 

0 

lX  <  1 

£ 

£ 

1  >k  >1-2  ’ 

0 

k 

1 

0 

i  •  •  « 

k 

1+k 

•  •  t  • 

u 

or 

1+k 

o 

•  •  • 

1 

1+k 

1+k 

k=l-2  ck 

o 

1  A 

n 

K 

1  A 

fOlM 

a! +k-ka} 

l-2a, 

0 

1+k 

•  •  •  * 

or 

1+k 

•  •  • 

•  *  •  • 

a^+k-ka^ 

a2— 

1 

0 

l-2a^ 

1+k 

2 

1+k 

k=l 

0 

a2+a4~ 

1 

2 

1 

2 

•  •  •  • 

•  •  •  • 

or 

•  *  • 

1 

1 

al+a3~  2 

0 

2 

H 

11 

ft 

O,  =  0  or 

ck2  = 

0 
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Other  Criteria. 

A  general  class  of  decision  criteria  related  to  that  of 
Laplace  are  those  which  construct  a  linear  "utility"  function  on 
the  simplex  3.  The  choice  set  is  then  the  subsimplex  of  points 
which  maximize  this  function. 

Since  we  are  only  interested  in  differences  of  utility,  there 
is  no  need  to  distinguish  between  functions  which  differ  by  a 
constant.  This  identification  of  functions  is  best  accomplished 
by  considering  changes  in  strategy  instead  of  fixed  strategies. 

For  any  linear  function  u  and  change  of  strategy  ZA  s  the 
function 

uT(As)  =  u(s+As)  -  u(s) 

is  independent  of  s,  and  does  not  change  if  a  constant  is  added 
to  u.  :.‘/e  wish,  therefore,  to  study  linear  function  which  are 
defined  for  all  ZL  s  and  which  vanish  for  Za  s=0 .  Let  ZA  denote 
the  space  of  strategy  changes  ZAs.  If  an  element  of  3  is  rep¬ 
resented  by  coordinates  (a-j,  ...,  a  ),  +  •••  +  am  =  1 , 

ai  A  then  an  element  of  ZA.  is  represented  by  (  ^  ,  ...,  ^m) 

,  +  •••+  o<  =  0,  such  that  ^  .  +  •  •  •  +  ex’.  <  1  for  all 

m  •  ij.  ik- 

sets  of  distinct  i, ,  ...,  i,  .  The  set  of  linear  functions  on 
/\  which  vanish  at  the  origin  form  a  vector  space  which  may  be 
denoted  by  U. 

Corresponding  to  each  mixed  strategy  for  Nature,  that  is 
each  probability  combination  of  columns,  there  is  a  linear  function 
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on  S  and  therefore  a  point  in  U.  If  Nature  plays  the  column 
c  =  (cl9  c  )  and  the  change  of  strategy  £=  (  cX]  ,  .  ..,  cK^) 

is  made  by  the  player,  the  gain  to  the  player  is  the  inner  product 

C* 6  =  C,c*,  +  •••  +  Cmcxm. 

This  product  defines  the  linear  function  on  corresponding  to 

this  strategy  of  Nature.  The  set  of  all  points  which  correspond 
to  mixed  strategies  for  Nature  forms  a  convex  set  NCU .  It  seems 
reasonable  that  the  function  we  should  construct  should  be  a 
point  of  N  so  that  the  solution  will  be  a  Bayes  solution.  This 
condition  is  equivalent  to  the  following. 

3.2  For  any  pair  of  rows  or  probability  mixtures  of  rows  r 
and  r* ,  the  linear  function  u  satisfies 

T 

u(r-r')  <  Max  (r.-r.) 

j  I  J 

where  j  runs  over  all  columns. 

In  particular  if  each  component  of  r  is  less  than  the  cor¬ 
responding  component  of  r'  then  u(r)  <  u(r'),  which  is  just 
axiom  1.4. 

In  this  context  the  problem  of  choosing  a  decision  criteria 
becomes  simply  the  problem  of  defining  a  unique  point  in  each  con¬ 
vex  set  N  in  some  suitable  way.  Since  we  wish  to  assume  some 
type  of  average  behavior  for  Nature,  this  point  may  be  considered 
as  a  center  for  N.  The  Laplace  criterion  is  that  obtained  by 
taking  the  center  of  gravity  of  the  vertices  of  N,  the  weight  of 
each  vertex  being  the  number  of  times  the  corresponding  column 
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appeared  in  the  matrix.  An  obvious  modification  of  this  would  be 
to  take  the  center  of  gravity  of  the  convex  set  N;  but  this  is 
unfortunately  discontinuous  when  N  changes  dimension. 

One  especially  simple  case  occurs  when  N  is  a  line  segment. 
Any  reasonable  definition  for  center  will  then  specify  the  midpoint 
of  N.  This  occurs  in  two  cases.  If  the  original  matrix  has  two 
columns,  then  these  columns  generate  a  line  segment,  the  center  of 
which  gives  the  Laplace  solution.  If,  on  the  other  hand,  the 
matrix  has  two  rows,  then  the  space  of  linear  functions,  and  there¬ 
fore  N  will  again  be  one  dimensional.  The  midpoint  in  this  case 


corresponds  to  the  following  rule. 


3.1  In  the  matrix 


choose  the  first  or  second 


row  according  as  Max  (a.-b.)  is  greater  than  or  less  than 

j  J  J 

Max  (b.-a.).  In  case  equality  holds,  choose  either  one  or  any 

j  3  3 

mixture . 


This  may  also  be  formulated  as  follows.  Assume  that  Nature 
will  play  that  column  for  which  the  player’s  decision  makes  the 
most  difference. 


One  possible  procedure  for  defining  such  a  center  for  N  will 
be  given.  Many  others  are  possible,  however,  and  I  have  no  strong 
justification  for  this  one.  It  is  motivated  by  the  following  idea. 
If  a  decision  procedure  of  this  type  cannot  satisfy  axiom  1.2,  we 


can  at  least  require  that  it  come  as  close  as  is  possible  to  satis¬ 
fying  it.  In  particular  we  can  require  that  for  every  two  row  sub¬ 
matrix;,  the  utility  of  changing  from  one  row  to  another  should  be 
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as  close  as  possible  to  the  utility  of  making  this  change  in  the 
full  matrix.  For  utility  function  u£N  the  utility  of  making 
the  change  $  is  u(  S').  In  the  two  row  submatrix,  the  utility 
of  changing  from  r  =  (a-,  ,  . ..,  afi)  to  r’  =  (bj  ,  bn)  ,  using 

the  midpoint  solution  just  mentioned,  is 

f  (  6  )  =  \  Max  (aj~bj)  +  \  Min  (a^.-bj )  . 

We  wish  therefore  to  find  which  u6N  can  be  considered  as  the 
best  approximation  to  f. 

Now  since  f  is  defined  in  terms  of  the  operations  Max  and 
Kin  applied  to  a  finite  number  of  linear  functions  over  Z\ , 
there  is  some  polyhedral  decomposition  of  such  that  the  function 

f,  and  therefore  u-f,  is  linear  on  each  polyhedron  of  the  decompo¬ 
sition.  It  is  therefore  sufficient  to  know  the  values  of  u-f  on 
the  vertices  of  these  polyhedrons  in  order  to  tell  how  well  u 
approximates  f.  Specify  the  function  u  by  the  following  three 
requirements . 

3.2  The  maximum  value  of  | u— f |  shall  be  as  small  as  possible. 

3.3  The  set  of  vertices  on  which  this  minimax  value  is  attained 
shall  be  as  small  as  possible. 

This  last  condition  is  meaningful  since,  if  u,  and  u2  attain 

the  minimax  value  on  different  sets,  then  attains  this  value 

2 

at  most  on  the  intersection  of  these  two  sets.  Therefore  by  aver¬ 
aging  a  finite  number  of  such  functions  we  obtain  a  function  which 
attains  this  value  on  a  smallest  possible  set  of  vertices.  The  set 
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of  u6K  which  satisfy  these  two  conditions  forms  a  convex  set, 
whose  closure  may  be  denoted  by  Let  Vt  denote  the  set  of 

vertices  at  which  these  functions  take  on  values  less  than  this 
maximum.  We  have  thus  proceeded  from  the  set  N=N0  and  the  set 
v0  of  all  vertices  to  smaller  sets  having  the  same  properties. 

We  now  proceed  inductively. 

3.4  Given  sets  and  v^  take  the  set  I\M+^  of  limit 

points  of  functions  satisfying  3.2  and  3.3  and  the  set  vj_+]_  of 
vertices  where  the  maximum  value  need  not  be  attained.  Repeat 
until  vr+^  Is  vacuous.  The  set  Nr+-^  will  then  consist  of  a 
single  point  which  is  the  required  function  u. 

The  criterion  which  is  specified  by  this  rule  satisfies  all 
of  the  axioms  of  section  II,  and  all  except  1.2  of  section  I.  The 
following  simple  example  will  be  worked  out  to  illustrate  the  rule. 


If  an  element  of  is  represented  by  a  triple  (  ^ )  ,  then 

the  vertices  of  /\  (which  is  a  hexagon)  are  (1,— 1,1) .  The  func¬ 
tion  f  is  derived  from  the  three  functions 

5ck+//?+4<T  cK  +  4^/d5  +  fr  +  z^S  +  4  <r 


by  the  operations  Max  and  Min.  It  is  therefore  nonlinear  whenever 
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two  of  these  functions  are  equal.  This  defines  the  three  lines 

5‘=<+//5,  +  4r=^  +  4/d1  +  f  or  4^  +  3  ^  =  ^4^, 

2^'  =  ^/5  ,  and.  2  ^  +  3  =  2/^. 

The  vertices  of  the  polyhedrons  formed  by  these  lines  and  Z\  , 
other  than  (0,0,0)  and  the  six  already  mentioned  are  (-  ^,1) , 

(|^,-1),  (^,§,-1),  (-3,-fa),  (1,-  | |)  ,  (-1,-|,|).  Listing 

one  vertex  from  each  pair  consisting  of  a  vertex  and  its  negative, 
we  have  the  following  table. 


Vertex  cf 

Max  c  •  cT 
c 

Min  c  •  cf 
c 

f_ 

(1,-1,0) 

4 

-3 

1 

2 

(1,0,-1) 

1 

-1 

0 

(0,1,— 1) 

3 

-3 

0 

(-  7,1) 

i7 

_  2 

7 

-1 

14 

(1  2  _1} 
v  3  >  3  >  x  * 

2 

_  1 

3 

1 

5 

(1  _  I  _  k\ 

1  ’  5’  5 

.  2 

5 

1 

2 

If  ul0*,^,  X~)  =  a  +  b^/^1,  then  the  vertices  of  N  are  the 

three  points  (a,b)  =  (1,-3),  (0,3),  and  (-1,-2).  The  point 

(a,b)  =  irr t)  is  in  the  triangle  N  and  is  a  best  approxima¬ 
te  3 

tion  to  f,  using  3.2  alone.  The  values  of  u  are  shown  in  the 


following  table 


RM-679 


-15- 


f(  cf  )  u(  <f  )  I  f-u  1 


1 

2 

0 


0 

_1 


14 

1 


5 


1 

2 


2 

5 

X 

15 

1 

3 

1 

105 

1 

"  5 

_2_ 

15 


_1 

10 

1_ 

15 

1 

3 

11 

30 

11 

30 

11 

30 


Using  this  function  u  as  utility  function,  we  find  that  it  is 
worth  to  switch  from  row  3  to  1;  and  worth  to  switch 

from  2  to  1.  The  choice  set  is  therefore  row  1. 

I  have  not  computed  the  example  of  section  II  in  detail,  but 
it  seems  that  the  solution  is  identical  with  the  Laplace  solution 
in  this  case.  The  function  u  depends  on  k  in  a  fairly  compli¬ 
cated  way,  however. 


§IV.  n- person  Games. 

Consider  a  game  of  the  following  type.  Each  of  n  persons 
makes  a  move  s.  out  of  a  set  of  possible  moves  S . ,  i  =  1,  ...,  n. 

l  l 

Simultaneously,  nature  makes  a  move  s0  out  of  a  set  of  possible 
moves  S0 .  The  payoff  P^(s0,s1,  ...,  sn)  is  defined  for 
i  =  1,  ...,  n.  As  an  example  of  this  consider  a  game  in  which 
the  players  have  incomplete  information  as  to  the  rules.  This  may 
be  treated  as  a  case  in  which  nature  chooses  a  particular  set  of 
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rules  s0  out  of  a  set  of  possibilities. 

Suppose  that  each  player  has  some  criterion  which  he  believes 
in  for  games  against  nature.  Then  if  each  player  except  i  has 
made  a  choice  of  strategy  beforehand,  the  ifc^  player  is  playing 
a  one  person  game  against  nature,  and  can  apply  his  criterion.  By 
an  equilibrium  point  of  such  a  game  will  be  meant  a  set  of  strategies 
s,  ,  ...,  sn  possessing  the  following  property.  For  each  i,  if 
players  1,  ...,  i-1,  1+1,  .n  hold  their  strategies  fixed, 

then  according  to  player  if s  criterion  s^  is  one  of  the  optimal 
strategies  in  the  resulting  game  between  player  i  and  nature. 

This  is  a  generalization  of  the  definition  of  equilibrium  point 
for  ordinary  n- person  games  which  was  given  by  John  Nash. 

Theorem.  If  each  player  has  a  criterion  satisfying  axioms 
2.1,  2.2,  and  2.3  then  there  exists  at  least  one  equilibrium  point. 

The  proof  by  the  Kakutani  fixed  point  theorem  follows  Nash's 
proof. 

In  most  applications  different  players  will  have  different 
amounts  of  information  as  to  what  nature  will  do.  The  game  which 
we  have  considered  does  not  take  this  into  account  as  it  stands; 
but  this  game  can  be  considered  as  a  normal  form  for  a  much  wider 
class  of  games.  This  reduction  to  normal  form  is  illustrated  by 
some  of  the  examples  which  follow. 

We  will  consider  variations  in  the  following  game  which  was 
suggested  by  E.  W.  Paxson .  It  is  modeled  after  a  "no  limit”  poker 
game.  Players  I  and  II  have  resources  M  >  0  and  N  >  0  respec¬ 
tively.  Each  knows  his  own  resources  but  may  have  incomplete 
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information  as  to  his  opponent's.  Each  has  a  choice  of  matching 
or  not  matching.  If  they  match,  the  one  with  higher  resources 
wins  the  resources  of  the  other.  In  case  of  a  tie,  no  exchange 
is  made.  If  one  player  fails  to  match,  he  is  penalized  half  his 
resources,  and  the  other  player  collects  this  amount.  If  both 
fail  to  match,  each  is  penalized  half  his  resources  and  no  one 
collects.  If  0  denotes  matching  and  1  denotes  not  matching, 
we  have  the  following  matrix. 


y  =  0  y  =  1 


N, -N  if  M >  N 

O, 0  if  M  =  N 

-M,M  if  M <  N 

N  N 

2  >  2 

M  M  M  N 

2»2  2  ’  2 

. . . .  -  . .  . 

The  equilibrium  points,  under  the  assumption  that  both  players  use 
the  Savage  criterion,  are  given  below. 

4.1  Each  player  knows  his  own  resources  but  has  no  information 
about  those  of  his  opponent.  In  this  case  there  are  two  equilibrium 
points.  One  at  x=0 ,  y-^  and  the  symmetric  one  at  x=^,  y=0. 

This  non-unique  answer  seems  to  be  caused  by  the  unboundedness  of 
the  set  of  possible  resources,  rather  than  by  the  non-zero  sum  nature 
of  the  matrix.  This  is  illustrated  by  the  following  case. 

4.2  Each  player  is  told  only  his  own  resources,  but  both  are 
given  the  information  that  0  <  M  <  1,  0  <  N  <  1.  There  is  now  a 
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unique  equilibrium  point  at 


x  =  Minf  .1569,  .5524  + 


.0262  _  t0246-H.040M-1.019M^  1 

M  SM  I 


where  y  is  the  same  function  of  N.  The  graph  of  this  function 
looks  roughly  as  follows. 


The  motivation  for  the  greater  tendency  to  match  when  the  resources 
become  very  small,  is  simply  that  there  is  very  little  to  lose  in 
this  case  and  a  great  deal  to  gain  in  case  the  opponent  passes  with 
larger  resources. 

4.3  The  resources  are  again  allowed  to  be  arbitrarily  large. 
Each  player  is  told  his  own  resources  and  given  an  upper  and  lower 
bound  for  the  resources  of  his  opponent.  In  other  words  Nature 
chooses  the  six  numbers 

0  <  N’  <  N  <  N"  and  0  <  M»  <  K  <  M" . 

Player  I  is  told  M , N ' ,  and  N".  Player  II  is  told  N,M»,  and  M". 
The  matrix  is  as  before.  In  this  case  there  is  again  a  unique 
equilibrium  point  in  which  the  two  players  have  the  same  strategy. 
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This  strategy  is  given  by 

0  for  N"  <  M 
^  for  N’  <  M  <  N"  <  14M 

X  =  ''  N^2M  for  M  -  N'  and  2M  -  N" 

or  for  14M  <  N" 

1  for  M  <  N '  and  N"  <  2M 

M 

A  graph  of  x  as  a  function  of  follows.  The  upper  branch  is 

for  N’  >  M  and  the  lower  branch  for  N'  <  M. 


The  solutions  to  this  example  by  the  criteria  of  section  III 
are  much  simpler.  I  do  not  know  how  to  extend  this  criterion  to 
infinite  games  in  general.  For  these  particular  examples,  however, 
since  there  are  only  two  alternatives  for  the  player,  the  solution 
may  be  obtained  from  3.1. 

For  the  games  4.1  and  4.2  the  solution  is  to  match  (play  x=0) 


in  all  cases.  For  4.3  the  solution  is  to  match  whenever  N’  <  M, 
that  is,  whenever  it  is  possible  that  N  <  M,  and  to  pass  otherwise. 
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APPENDIX 

I.  Proofs  of  assertions  in  section  1. 

1.10  Suppose  (P1P2P3  *•*  Pn)  >  (P2P1P3  •**  pn) •  Interchanging 
the  first  two  columns  we  have  (p2PiP3  ***  pn)  >  (P1P2P3  **•  pn) 
which  is  a  contradiction.  Since  any  permutation  of  the  elements  can 
be  achieved  by  successive  interchanges  of  pairs  of  elements,  the 
conclusion  follows. 


1.11  Suppose  the  average  of  the  elements  of  r  equals  the 
average  of  the  elements  of  r' .  Alternately  perform  the  following 
operations  on  the  two  row  submatrix  formed  by  r  and  rT . 

a)  Permute  the  elements  of  each  row  so  that  they  are  in 
order  of  increasing  size. 

b)  Subtract  from  each  column  the  minimum  of  its  element. 
After  a  finite  number  of  steps,  all  of  the  components  of  the  matrix 
will  be  zero,  and  therefore  r~rT. 

It  follows  by  1.4  that  if  the  average  of  the  elements  of  r  is 
greater  than  the  average  of  the  elements  of  r' ,  then  r  >  r' . 

1.12  It  is  sufficient  to  show  that  if  m  <  P^  <  M  then 

(m,m,  ...,  m,M)  is  equivalent  to  (m,?,,  . ..,  P  2,J4).  Alternately 
applying  1.10  and  1.6  we  have  the  following  sequence  of  pairs  of 
equivalent  rows. 
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■  » 


M 

m  )  _ > 

f  M  M  - 

m  \ 

m 

M  J 

l  m  m 

M  J 

m 

M  M'\ 

P^ 

m 

m  Mi" 

ra 

m 


n-2 


I*"! 

ra 


The  next  to  the  last  column  may  be  removed  by  reversing  all  but  the 
last  step. 


1.13  By  1.12  it  is  sufficient  to  consider  pairs  (M^,hk) 

satisfying  >  m^  in  place  of  rows.  Let  be  the  supremum 

of  ^  such  that 

(  ^  M.  +  (l-^Jm.  ,  +  (l-^)m.  )  <  (K.  ,ra.  )  and  0  <  <=*■<  1. 

By  axiom  1.7  the  ^  which  is  obtained  is  independent  of  i 
providing  only  that  Mi  >  rru.  Denote  it  by  By  axiom  l.g, 

(  c*  M  +  ( l-  c*. )  m  t  &  K  +  ( 1-  pk  )  ^  ( M  ,171). 

It  follows  by  axiom  1.4  that  the  criterion  is  that  of  Hurwicz. 

1.14  Again  consider  pairs  (M,m).  In  the  game  given  by  the 
matrix 

f  M  m  m'N 

y  m  M  m  J  ’ 

since  the  choice  set  is  symmetric  and  convex  it  must  contain  the 
average  of  the  two  rows  which  is 


M+tn  M+m 
2  ’  2  ’  m 
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Therefore  (M,m)  .  By  repeated  application  of  this  rule 

together  with  the  axiom  of  continuity  we  find  that  (M,m)  < 

It  follows  by  axiom  1.4  and  continuity  that  (M,m)  >  (M’,mT)  if 
and  only  if  m  >  m' ,  which  is  the  minimax  criterion. 


II.  Derivation  of  Hurwicz  solution  for  the  given  matrix. 

We  wish  to  choose  the  a^  so  as  to  maximize 

Max  (a2  "**  a3  +  a4  j  a2  +  ka4 ,  ai  +  ka2  ,  a  ^  a  3  ^  a4 ) 

+  (1-^)  Min  (a2  +  a3  +  a4,  a2  +  ka4 ,  aj  +  ka2 ,  a}  +  a3  +  a  4) 

=  c<Max  (a2  +  a3  +  a4 ,  a!  +  a3  +  a4)  +  (l-o<)  Min  (a2  +  ka4 ,  a1  +  ka3). 


This  maximum  can  only  be  achieved  if  a2  +  ka4  =  a5  +  ka2  providing 
that  *=<  <  1  .  Assume  for  example  that  a2  >  a,  .  This  implies 
that  a4  =  0  for  otherwise  by  decreasing  the  value  of  a4  and 
increasing  a2  and  a3  we  could  increase  the  value  of  the  expres¬ 
sion.  The  equations 

a2  =  St  +  ka3 

a1  +  a2  +  a3  =  0  yield 


a2 


aj  +k-ka] 
1+k 


This  together  with  0  <  a?  <  a2  determines  a  line  segment  one  end 
of  which  is  (^,^,0,0)  and  the  other  end  of  which  is  > i+k >i+k *0)  . 

The  expression  is  a  linear  function  on  this  segment  and  therefore 
takes  its  maximum  either  at  one  end  or  on  the  entire  segment,  as 
is  tabulated  in  section  II. 
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